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(57) Abstract: The present invention relates to 
a method of detecting blocking artifacts in dig- 
ital video pictures. The detection method com- 
prises a step of filtering (GF) a digital input sig- 
nal (x) using a gradient filter for providing at 
least one filtered signal and a step of calculating 
(CALC) a block level metric (BM) for process- 
ing the filtered signa](s) to identify and count 
blocking artifacts as a function of their position 
in a grid. If the block level metric (BM) is lower 
than a threshold, the picture has either not been 
encoded using a block-based processing, or has 
been encoded in a seamless way. In the oppo- 
site case, the picture has been encoded using a 
block-based processing in a non-seamless way 
and corrective actions* such as a post-process- 
ing (PP), can be taken. 



< 

o 
O 



CALC 




wo 01/20912 



1 



PCT/EPOO/08497 



METHOD AND DEVICE FOR IDENTIFYING BLOCK ARTIFACTS IN DIGITAL VIDEO PICTURES 



The present invention relates to a method and its corresponding device for 
detecting blocking artifacts in digital video pictures. 

The present invention also relates to a method and its corresponding device for 
processing a sequence of digital video pictures comprising a detection step of blocking 
5 artifacts and a post-processing step. 

The present invention further relates to a set-top-box and a television set 
comprising such devices. 

10 Video sequences encoded with existing international video encoding standard 

can sometimes present some degradations, such as blocking artifacts. The commonly 
encountered degradations can go from very little impairments to heavy degradation 
depending on the encoding bit rate. Several methods of measuring the blocking artifact level 
have already been introduced. Based on the human visual sensitivity, said methods require 

15 both the original and the reconstmcted images and are rather complex to implement. As a 
consequence, they cannot be used when the original pictures are not available. 

To solve this problem, a new method is disclosed in the paper "Quantitative 
quality metrics for video coding blocking artifacts" by H.R. Wu and M, Yuen in Proceedings 
of Picture Coding Symposium, vol. 1, pp. 23-28, March 1996. This method uses only the 

20 reconstructed video pictures to determine a block level metric. Unfortunately, the block level 
metric calculation is very complex in terais of number of operations and of memory 
requirements, making it unrealistic for an implementation in a commercial product. 
Moreover, this method assvmaes that the first encoding block starts at the top right pixel of the 
picture, which is not always true if said picture has been converted to analog before being 

25 converted to digital. 

It is an object of the invention to provide a method of detecting blocking 
artifacts contained in digital video pictures, which processes video pictures without an a 

I 
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priori knowledge of original pictures or any information related to the encoding process, and 
which can be easily implemented in a hardware application. 

To this end, the method according to the invention is characterized in fliat it 

comprises: 

5 - a step of filtering a digital input signal using a gradient filter for providing at least one 
filtered signal, 

- a step of calculating a block level metric indicating if the picture has been encoded or not 

using a block-based processing, for processing the filtered signal(s) to identify and count 

blocking artifects as a fimction of their position in a grid. 
10 Such a method can detect blocking artifacts with an efficient and simple 

algorithm that only needs the reconstracted pictures. If the computed block level metric is 

lower than a threshold, the picture has either not been encoded using a block-based 

processing, or has been encoded in a seamless way. In the opposite case, the picture has been 

encoded using a block-based processing in a non-seamless way. 
1 5 The method according to the invention is also characterized in that the 

calculation step comprises a sub-step of detemiining a shift of an origin of the grid in the 

picture in order to compute the block level metric. 

Said method does not assume that the first encoding block starts at the top 

right pixel of the digital video picture. As a consequence, said method can be implemented 
20 directly in a television set, without knowing in advance if the incoming picture has been 

previously converted firom digital to analog and flien to digital again. 

It is another object of flie invention to provide a method of processing a 

sequence of digital video pictures comprising this step of detecting blocking artifects and a 

step of post-processing the digital video pictures if the block level metric provided by the 
25 detection step is higher than a threshold. 

Such a processing method benefits tmm the block level metric computed in 

the detection step in order to take the right corrective actions and, consequently, to adapt in a 

suitable way the post-processing step. 

Finally, it is an object of the invention to provide a device implementing such 
30 a detection method. Such a device will be advantageously integrated into set-top-boxes or 

into up-market television sets. 

These and other aspects of the invention will be apparent firom and elucidated with reference 
to the embodiments described hereinafter. 
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The present invention will now be described, by way of example, with 
reference to the accompanying drawings, wherein : 

Fig. 1 is a block diagram of an MPEG block detector according to the 

5 invention, 

Fig. 2 represents the first column of an MPEG block and its two nearest 

neighbors, and 

Fig. 3 is a flowchart for the process used to perform the identification of 
blocking artifacts. 

10 

The present invention proposes a new method of detecting blocking artifacts 
contained in digital video pictures. Such a method comprises two major steps, as illustrated in 
the block diagram of figure 1, The first one is a step of gradient filtering (GF), the second one 
15 is a step of calculating a block level metric (CALC). 

This method has been developed for MPEG applications, especially for 
broadcasting applications, but also remains valid for applications using a block-based 
processing for motion estimation, and a discrete cosine transform (DCT) such as, for 
example, H.261 or H.263 of the International Telecommunication Union (ITU). 

20 



In the preferred embodiment, the detection method uses the luminance 
component of the video signal, but it is also possible to use the chrominance components of 
said video signal. This method is successively applied to each field of a picture in the case of 

25 an interlaced sequence of pictures, or directly to a firame in the case of a progressive 
sequence. Moreover, in order to save memory cost, only half a field is scanned in the 
horizontal direction instead of the whole field. For this purpose, an active window (AW), 
having a length of 360 pixels and a height of 288 pixels in fiiU-fonnat encoding (i.e. the 
encoding picture is 720x576 pixels in said format), is positioned in the field in order to select 

30 a portion of said field, giving a re-sized video signal (x) fi-om the luminance signal (y) 

corresponding to the whole field. Anyway, the dimensions of the active window (AW) can be 
modified depending on the method accuracy or the memory allocation required by the user. 
The active window (AW) proposed in the invention is a good trade-off between these two 
parameters, because it divides the memory cost by two without a significant degradation of 
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the results given by the detection method. Said method can also be improved by changing the 
position of the active window (AW) for each field. In the preferred embodiment, the active 
window (AW) is put on the left side of the odd field and on the right side of the even field. 
This implementation is both simple and efficient, but other implementations are possible such 
5 as, for example, to take a random active window (AW) for each field. 



10 



15 



The re-sized video signal (x) is filtered using the gradient filtering (GF) step. 
To this end, a high-pass filter hi=[-l 1] is applied in both horizontal and vertical directions, 
giving respectively a horizontal filtered pixel array (Xh) and a vertical filtered pixel array (Xv). 
Other gradient filters can be used for this ^plication such as, for examples, another one- 
dimensional filter h2=[-l 0 1] or a two-dimensional filter ha, called the Sobel filter, which is 
defined as follows: 



h3 = 



-1 0 1 
-2 0 2 
-10 1 



20 



The gradient filter hi used in the preferred embodiment has been chosen for its 
higji sensitivity and its low complexity. 

In another embodiment, the gradient filtering step is performed in only one 
direction, either the horizontal one or the vertical one, giving respectively only a vertical or a 
horizontal blocking artifact detection, but also leading to a lower efficiency of the detection 
method. 



A calculation step (CALC) is then performed on the two arrays of pixels (xh 
and Xv), this calculation step comprising three sub-steps. 
25 During the first sub-step (ABS), the arrays of tiie absolute values of the 

horizontal and vertical filtered pixels are built. 

Then, in the second sub-step (AV), the average of the absolute values obtained 
in the first sub-step is computed over tiie field for both horizontal and vertical arrays. 

Finally, the third sub-step (ID) consists of the identification of blocking 
30 artifacts fi-om the previously computed values of the first and second sub-steps. 

The result of the calculation step (CALC) is a blocking artifact level metric 
(BM) for each field of a sequence of pictures. Depending on the value of said metric (BM), a 
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post-processing step (PP) is either or not applied to the incoming video signal (y), giving a 
filtered signal (yf). 

5 The followin g notations are xised in the present document: 

y[ij] is the luminance array of the incoming field, i being the line index and j 
being the column index of said field, numbered fi^om 0, 

x[ij] is the luminance array corresponding to the re-sized video signal, i and j 
still being the line index and the column index of the incoming field, 
1 0 Xh[i J] and Xv[i j] are the luminance arrays after the horizontal and vertical 

gradient filtering step applied to x[ij], 

xah[ij] and xav[ij] are the arrays containing the absolute values of the filtered 
pixels constituting respectively Xh[iJ] and Xv[iJ], 

xa^^ and xa7 are the averages of respectively xah[ij] and xav[ij] over the 
15 portion of the field corresponding to the active window (AW). 

Blocking artifacts are the result of DCT-block quantization. They occur at the 
boundary of MPEG blocks. To determine if a blocking artifact is present on a particular block 

20 boundary, the characteristics of the filtered arrays xah[i j] and xav[i J] are investigated. 
Horizontal blocking artifacts are detected in the vertically filtered array xav[ij], whereas 
vertical blocking artifacts are detected in the horizontally filtered array xah[ij]. A blocking 
artifact is found if the absolute values of the eight filtered pixels xah[ij] to xah[i+7 j] 
belonging to a block boundary are noticeably greater than their neighbors. Figure 2 represents 

25 the first colimm of an MPEG block xah[ij] to xah[i+7 j] and its two nearest neighbors. A 
vertical blocking artifact is detected by the invention if the two following conditions are 
fiilfilled between columns of the horizontally filtered array xah[ij]: 

xah[nj]> xajnj - 1]+ ^ 

Jl_ Vn€[l,i + 7j 

x^h ["/ j] > xa h [n J + 1] + ^ 
The same operation is performed between lines of the vertically filtered array 

30 xav[ij]: 
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xav[i,m]>xav[i-l,m]+— ^ 

_ Vm e [j J + k - 1] with k = 8, 10 or 12 

xa V [i/ m] > xa V [i + 1/ m ] + 

The size of the grid 8xk, corresponding to the area of investigation, depends 
on the MPEG block size and, as a consequence, on the encoding format. Due to the encoding 
formats mainly used by broadcasters, different grid sizes are possible such as 8x8, 8x10, 
8x12. However, it will be apparent to a person skilled in the art that the invention is not 
limited to the block of such sizes. 

In the preferred embodiment, the horizontal grid size k is determined by 
computing the distance count_grid between a current blocking artifact and the previous one. 
If the value of the vertical counter count_VO-l] is strictly higher than a threshold, which is 
equal to 3 in this embodiment, and if the distance count^grid is equal to 8, the value of a 
coxmter grid_8 is incremented by one; or if the value of the vertical counter count_V[i-l] is 
strictly higher than the threshold and if the distance count_grid is equal to 10, the value of a 
counter grid_10 is incremented by one; or if the value of the vertical counter count_V[j-l] is 
strictly higher than the threshold and if the distance count_grid is equal to 12, the value of a 
coimter grid_12 is incremented by one. Once the field has been processed, the horizontal grid 
size of 8, 10 or 12 corresponding to tiie greater counter among grid_8, grid_10 and grid_12 
counters is selected. The selection is validated if the same results have been found for the 
four previous fields. Moreover, the value of the horizontal grid size k must be initialized for 
the first field, for example to 10. 



Figure 3 is a flowchart that describes more precisely the algorithm used to 
perform the identification of blocking artifacts in a field. 

The blocking artifact identification method is described here for the horizontal 
array giving a vertical artifact characterization. The same algorithm is appUed to the vertical 
array giving a horizontal artifact characterization then. 

The scanning process starts at the top-left of the field and with an initialization 
to zero of the parameters used in the algorithm (ST). Then, the field is scanned line by line 
down to the bottom-right of the field and, for each pixel of coordinates (ij) belonging to the 
re-sized video signal (x), the following tests are performed. 
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7 

The values of xah[i j-2], xah[ij-l], xah[ij] and xah (respectively xav[i-2 j], 
xav[i-l J]) xav[ij] and xa^ for the horizontal artifact characterization) are first downloaded 
(LX). For reasons of implementation, the value of xaj^ is the value computed for the previous 
field. 

5 A first test (C 1 ) is performed on the downloaded values. The result of the test 

is trae (Yl) if the two following conditions are fulfilled: 

xahIU-l]-xaJu]>i^|5J. 

xah[i/j-l]-xah[U-2]>^ 

In that case (Yl), a vertical counter count__V (respectively count_H for the 

horizontal artifact characterization) is incremented by one (INC) for the column j-1 

10 (respectively for the line i-1); in the opposite case (Nl), a second test (CI) is performed on 

the value of the vertical counter. The result of the second test is true (Y2) if the two following 

conditions are fulfilled: 

fcount^VU-l] ^8 

[count _ vy - 1] < contour _ V 

where contour_V is the number of vertical consecutive pixels above which the 

15 algorithm decides that a vertical contour has been detected. In the preferred embodiment, the 

value of contour_V is set to 16 pixels, whereas the value of contour_H corresponding to a 

horizontal contour detection, is set to 3k pixels. 

If the second test (C2) is satisfied (Y2), a coefficient artifact_coxmt[p,q] of an 

array artifact_coimt corresponding to the grid of investigation whose dimensions are 8xk, is 

20 incremented by one (INCA). Then, the vertical coimter is decremented by one (DEC). The 

values of p and q are the following: 

p = (l- count _v|j-l])%8 
q = (j-l)%k 

where the result of the operation a%b is the rest of the division of a by b. 
The incrementation (INCA) and decrementation (DEC) operations are 
25 followed by a third test (C3) and are repeated while the third test is not satisfied (N3), that is 
while count_V|j-l] > 8. 

If the second test (C2) is not satisfied (N2) or if the third test (C3) is satisfied 
(Y3), the vertical counter count_V[j-l] is set to zero (INI). 
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After the incrementation step (INC) or the re-initialization step (INI), a fourth 
and last test (C4) is performed. If the end of the field has not been reached (N4), the scanning 
process (SC) goes on and the next values of the arrays xah[ij] are downloaded. In the 
contrary case, the value of a blocking artifact level metric (BM) is computed as follows: 



^ 7 k-l 

BM = artifact_a)unt[0,0]- — J^^artffact^countlU 



:0 



The calculation of the blocking level metric (BM) value has been described 
assuming that the blocking artifacts detection starts at position (0,0). Such a calculation step 
can be implemented in a set-top-box just after the decoding process. But to be implemented 
10 in a television set, some modifications concerning the blocking level metric calculation have 
to be done because we have no more hypothesis on the MPEG grid origin in this particular 
case, as the video has been converted from digital to analog and then to digital again. In this 
second embodiment, the blocking level metric (BM) is computed as follows: 



1 ^ . 

BM = artifact _ count[shift . row, shift _ column] - -jj- XI S ^^^^ - count[i, j] 

1=0 j=0 

artifact _ count[shift _ row, shift _ column] - gj^ X S - count[i, j] 



16 



15 where shift_row and shift_column are such that 

artifact _count[shift_ row, shift, column] = nJJ^^^^ (arb'fact.count[ij]) 

and where IND is a consistency variable that is incremented by one if two successive fields 
have the same grid origin and decremented by one in the other case without being negative or 
higher than 15. However, the grid position must not take into account the vertical grid shift 

20 shift_row as described above, which is only valid for a field, but the vertical grid shift 

shift_row_frame corresponding to a firame. The vertical grid shift of a fi-ame shift_row_fi^e 
is computed from the vertical grid shift of a current field shift_row and the one of the 
previous field last_shift_row as follows: 

shift_row_fiame = (shift_row -h last_shift_row) % 8. 

25 The consistency variable (IND) is an indicator of the stability of the grid 

position accross the successive fields. If this position is stable, that is if the consistency 
variable (IND) is greater than 5, it means that the sequence is likely to be MPEG encoded. 
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In addition to the blocking level metric (BM) calculation, the above described 
method provides the shift of the grid origin, which can be very useftd if a block-based post- 
processing needs to be applied to the incoming video signal. 



The value of the blocking artifact level metric (BM) obtained for the two 
different embodiments is finally low-pass filtered (LPF) over the sequence of pictures in 
order to achieve a better stability of the method. In the preferred embodiment, a recursive 
filter is used to perform this operation. This recursive filter allows to obtain the filtered value 
10 (BMf) of the block level metric corresponding to a field N and is defined as follows: 

BMf (n) = BMf (n ~ l) + >.(BM(n) - BMf (N - 1)) 

where X is a coefficient ensuring the stability of the process and which is 
equal to 0,1 in the preferred embodiment. 

In another embodiment, the low-pass filtering operation is performed by 

15 computing the average of the last processed fields. 

The value of the filtered blocking level metric (BMf) is finally compared to a 
first threshold. This first threshold has been determined by applying the method described 
here to several sequences of original pictures and by rounding up the highest blocking level 
metric (BM) reached for a field. If fliis value is lower than the threshold, the picture is either 

20 not MPEG encoded or is MPEG encoded in a seamless way. If this value is higher than the 
flireshold, the picture is MPEG encoded in a non-seamless way. In that second case 
corrective actions, such as a post-processing for example, can be performed in order to 
remove the artifact. The value of the first threshold depends on the size of the active window 
(AW) and on the level of degradation which has to be detected. 

25 However, there is a minimum level of degradation that can be detected, this 

level corresponding to a second threshold. Between the first and the second threshold, the 
original sequences cannot be distinguished firom slightly degraded sequences, but blocking 
artifacts that are not visible to the human eye can be strengthened, becoming visible then, by 
an automatic contrast or sharpness enhancement process. Moreover, the value of the second 

30 threshold is such that very few false detections are possible. Thanks to the results given by 
the above-described detection method, the automatic enhancement algorithms can be 
switched off or adjusted. 
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It will be obvious that the verb "comprise" does not exclude the presence of 
other steps or elements besides those listed in any claim. Any reference sign in the following 
claims should not be construed as limiting the claim. 



wo 01/20912 

CLAIMS: 



11 



PCT/EPOO/08497 



1°: AT5aSlBoa"of (ietTCti 

in that said method comprises: 

a step of filtering (GF) a digital input signal (x) using a gradient filter for providing at least 
one filtered signal, 

5 a step of calculating (CALC) a block level metric (BM) indicating if the picture has been 

encoded or not using a block-based processing, for processing the filtered signal(s) to identify 
and count blocking artifacts as a function of their position in a grid. 

2. A method of detecting blocking artifacts in digital video pictures as claimed in 
10 claim 1 characterized in that the calculation step (CALC) comprises a sub-step of 

determining a shift of an origin of the grid in the picture in order to compute the block level 
metric (BM), 

3. A method of detecting blocking artifacts in digital video pictures as claimed in 
15 claim 1 characterized in that said method comprises a step of filtering (LPF) the value of the 

block level metric (BM) using a low-pass filter. 

4. A method of processing a sequence of digital video pictures comprising a step 
of detecting blocking artifacts as claimed in claim 1 and a step of post-processing (PP) the 

20 digital video pictures if the block level metric (BM) provided by the detection step is higher 
than a threshold. 

5. A device for detecting blocking artifacts in digital video pictures characterized 
in that said device comprises: 

25 - means for filtering (GF) a digital input signal (x) using a gradient filter intended to provide 
at least one filtered signal, 

- means for calculating (CALC) a block level metric (BM) indicating if the picture has been 
encoded or not using a block-based processing, intended to process the filtered signal(s) to 
identify and count blocking artifacts as a fimction of their position in a grid. 
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6. A device for detecting blocking artifacts in digital video pictures as claimed in 
claim 5 characterized in that the calculation means comprises means for determining a shift 
of an origin of the grid in the picture in order to compute the block level metric (BM). 

7. A device for detecting blocking artifacts in digital video pictures as claimed in 
claim 5 characterized in that said device comprises means for filtering (LPF) the value of the 
block level metric (BM) using a low-pass filter. 

8. A device for processing a sequence of digital video pictures comprising means 
for detecting blocking artifacts as claimed in claim 5 and means for post-processmg (PP) the 
digital video pictures if the block level metric (BM) provided by the detection step is higher 
than a threshold. 

9. A set-top-box comprising a device for detecting blocking artifacts as claimed 
in any of claims 5 to 7. 

10. A television set comprising a device for detecting blocking artifacts as claimed 
in any of claims 5 to 7. 

11. A computer program product for a set-top-box that comprises a set of 
instructions, which, when loaded into the set-top-box, causes the set-top-box to carry out the 
detection method as claimed in claims 1 to 3. 

12. A computer program product for a television set that comprises a set of 
instmctions, which, when loaded into the television set, causes the television set to carry out 
the detection method as claimed in claims 1 to 3. 
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